Context-aware semantic classification of search queries for browsing community question-answering archives
نویسندگان
چکیده
Community Question-Answering (cQA) platforms have become massive repositories of user-generated content. To a great extent, these archives have proven to be highly re-usable. For instance, web search engines profit from their best answers for enhancing user experience when resolving question-like queries. Hence, considerable research efforts have gone into trying to revitalize and retrieve past answers contained in these archives. However, similarly to traditional web search, there is a linguistic gap between cQA questions and question-like search queries that are utilized for fetching information from these cQA repositories (e.g., “rib pain after ovulation” and “iron oxide household”). In fact, this gap does not only consider linguistic features, but also structural and social attributes. On the one hand side, cQA questions are long-winded, they can bear a title and a body, and community members are compelled to categorize questions at posting time. On the other hand side, search queries come as an uncategorized short stream of words. Moreover, in juxtaposition to cQA question, users typically submit streaks of semantically related search queries, when attempting to fulfil their information needs. This work digs deep into effectively exploiting semantic cues, yielded by preceding queries within the same user session, for classifying question-like search queries into twenty-six semantic cQA question categories. In order to find significant discriminative properties, we carried out experiments on a large-scale dataset acquired automatically. Broadly speaking, our results indicate that more effective semantic features can be computed as long as we account for a larger number of previous queries. In particular, facilitating Explicit Semantic Analysis for modelling the query context shows to be extremely helpful for increasing the classification rate.
منابع مشابه
Automatically Generating Questions from Queries for Community-based Question Answering
This paper proposes a method that automatically generates questions from queries for community-based question answering (cQA) services. Our query-to-question generation model is built upon templates induced from search engine query logs. In detail, we first extract pairs of queries and user-clicked questions from query logs, with which we induce question generation templates. Then, when a new q...
متن کاملBuilding a Question Classifier for a TREC-Style Question Answering System
We define Question Classification (QC) here to be the task that, given a question, maps it to one of k classes, which provide a semantic constraint on the sought-after answer [Li02]. The topic of Question Classification arises in the area of automated question-answering systems, such as those created for the TREC question answering competition. Automated question-answering systems differ from o...
متن کاملModeling Community Question-Answering Archives
Community Question Answering (CQA) services contain large archives of previously asked questions and their answers. We present a statistical topic model for modeling Question-Answering archives. The model explicitly captures relationships between questions and their answers by modeling topical dependencies. We show that the model achieves improved performance in retrieving the correct answer fo...
متن کاملAquaLog: An Ontology-Portable Question Answering System for the Semantic Web
As semantic markup becomes ubiquitous, it will become important to be able to ask queries and obtain answers, using natural language (NL) expressions, rather than the keyword-based retrieval mechanisms used by the current search engines. AquaLog is a portable question-answering system which takes queries expressed in natural language and an ontology as input and returns answers drawn from the a...
متن کاملLearning to Rank Questions for Community Question Answering with Ranking SVM
This paper presents our method to retrieve relevant queries given a new question in the context of Discovery Challenge: Learning to Re-Ranking Questions for Community Question Answering competition. In order to do that, a set of learning to rank methods was investigated to select an appropriate method. The selected method was optimized on training data by using a search strategy. After optimizi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Knowl.-Based Syst.
دوره 96 شماره
صفحات -
تاریخ انتشار 2016